PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sopim12g044410.0.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum; Lycopersicon
Family HD-ZIP
Protein Properties Length: 838aa    MW: 92335.8 Da    PI: 6.5931
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sopim12g044410.0.1genomeCSHLView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox59.84.4e-191674357
                        --SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHC....TS-HHHHHHHHHHHHHHHHC CS
            Homeobox  3 kRttftkeqleeLeelFeknrypsaeereeLAkkl....gLterqVkvWFqNrRakekk 57
                        k  ++t+eq+e+Le+l++++++ps  +r++L +++    +++ rq+kvWFqNrR +ek+
  Sopim12g044410.0.1 16 KYVRYTPEQVEALERLYHECPKPSSMRRQQLIRECpilsHIEPRQIKVWFQNRRCREKQ 74
                        5679*****************************************************97 PP

2START175.33.7e-551603672204
                         HHHHHHHHHHHHHHHC-TT-EEEEEXCCTTEEEEEEESSS.SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-SEEEEEEEECTT..EE CS
               START   2 laeeaaqelvkkalaeepgWvkssesengdevlqkfeeskvdsgealrasgvvdmvlallveellddkeqWdetlakaetlevissg..ga 90 
                         +aee+++e+++ka+ ++  Wv+++ +++g++++ +++ s++++g a+ra+g+v  +++  v+e+l+d++ W ++++++e+l+ + ++  g+
  Sopim12g044410.0.1 160 IAEETLTEFLSKATGTAVEWVQMPGMKPGPDSIGIIAISHGCTGMAARACGLVGLDPT-RVAEILKDRPSWYRDCRAVEVLNMLPTAngGT 249
                         789*******************************************************.8888888888****************9999** PP

                         EEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--....-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE--SSXX CS
               START  91 lqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppe...sssvvRaellpSgiliepksnghskvtwvehvdlkgrlp 177
                         ++l +++l+a+++l+p Rdf+ +Ry+    +g++v++++S+ ++q+ p+    +++vRae+lpSg+li+p+++g+s v++v+h++l+++++
  Sopim12g044410.0.1 250 IELLYMQLYAPTTLAPpRDFWLIRYTTVTDDGSFVVCERSLGNTQNGPSmpqVQNFVRAEMLPSGYLIRPCEGGGSIVHIVDHMNLEAWSV 340
                         ************************************************9998899************************************ PP

                         HHHHHHHHHHHHHHHHHHHHHHTXXXX CS
               START 178 hwllrslvksglaegaktwvatlqrqc 204
                         +++lr+l++s+++ ++kt++a+l++++
  Sopim12g044410.0.1 341 PEVLRPLYESSAVLAQKTTMAALRQLR 367
                         ***********************9986 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5007115.7421175IPR001356Homeobox domain
SuperFamilySSF466893.51E-171377IPR009057Homeodomain-like
SMARTSM003892.3E-161379IPR001356Homeobox domain
CDDcd000868.52E-171676No hitNo description
PfamPF000461.1E-161774IPR001356Homeobox domain
Gene3DG3DSA:1.10.10.601.5E-181874IPR009057Homeodomain-like
CDDcd146861.10E-568107No hitNo description
PROSITE profilePS5084826.76150365IPR002913START domain
CDDcd088752.25E-75154369No hitNo description
SMARTSM002341.6E-40159369IPR002913START domain
Gene3DG3DSA:3.30.530.201.9E-24159365IPR023393START-like domain
SuperFamilySSF559612.75E-36159367No hitNo description
PfamPF018521.6E-52160367IPR002913START domain
PfamPF086702.8E-50694836IPR013978MEKHLA
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
GO:0008289Molecular Functionlipid binding
Sequence ? help Back to Top
Protein Sequence    Length: 838 aa     Download sequence    Send to blast
MSMSCKDGKS IEDNGKYVRY TPEQVEALER LYHECPKPSS MRRQQLIREC PILSHIEPRQ  60
IKVWFQNRRC REKQRKESSR LQGVNRKLSA MNKLLMEEND RLQKQVSQLV YENGYFRKQT  120
QTTKLASKDT SCESVVTSGQ HHLTPQHPPR DASPAGLLSI AEETLTEFLS KATGTAVEWV  180
QMPGMKPGPD SIGIIAISHG CTGMAARACG LVGLDPTRVA EILKDRPSWY RDCRAVEVLN  240
MLPTANGGTI ELLYMQLYAP TTLAPPRDFW LIRYTTVTDD GSFVVCERSL GNTQNGPSMP  300
QVQNFVRAEM LPSGYLIRPC EGGGSIVHIV DHMNLEAWSV PEVLRPLYES SAVLAQKTTM  360
AALRQLRQLT LEVSQPNVTN WGRRPAALRA LSKRLNRGFN EALNGFSSEG WSMLDNDGMD  420
DVTILVNSSP DKLMGLNLSF SDGFTSLSNA VLCAKASMLL QSVTPATLLR FLREHRSEWV  480
DNNIDAYSAA AVKVGPCSLP GVRVSNFGGQ VILPLAHTVE HEELLEVIKL EGVCHSPEDV  540
IMPRDMFLLQ LCSGMDENAV GTCAELVFAP IDASFADDTP LLPSGFRIIP LDSAKEASSP  600
NRTLDLTSAL ETGPVGSKVA NDLKSTGGTS KSIMTIAFQF AFESHMQENV ASMARKYVRS  660
FISSVQRVAL ALSPSNFGSL GGLRLPLGTP EAHTLARWIC QSYRRFLGVE LPKLSSEGSE  720
SLLDSLWHHS DAIICCSAKA LPVFTFANQG GLDMLETTLV ALQDISLEKI FDEHGRKNLC  780
SEFPQIMQQG FACLQGGICL SSMGRPVSYE KAVAWKVLNE EDTAHCIGFM FVNWSFV*
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_004252354.10.0PREDICTED: homeobox-leucine zipper protein ATHB-15
SwissprotQ9ZU110.0ATB15_ARATH; Homeobox-leucine zipper protein ATHB-15
TrEMBLK4DFC90.0K4DFC9_SOLLC; Uncharacterized protein
STRINGSolyc12g044410.1.10.0(Solanum lycopersicum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA45724140
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G52150.10.0HD-ZIP family protein